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DESCRIPTION 

METHOD OF DEFINING THE DIFFERENTIATION GRADE OF TUMOR 

5 Technical Field 

The present invention relates to a method of defining 
the differentiation grade of tumor. More particularly, the 
present invention relates to a method of defining the 
differentiation grade of tumor by selecting genes and/or 

10 proteins whose expression level correlates with each 

differentiation grade of hepatocellular carcinoma (HCC) , 
measuring the expression of the genes and/or proteins of human 
tumor tissues in each differentiation grade. The present 
invention also relates to the use of these genes and/or proteins 

15 for diagnosing the differentiation grade of HCC and for 
screening anti-cancer agents for HCC treatment. 

The present invention also relates to a kit for 

* 

performing the method of the present invention comprising DNA 
20 chips, oligonucleotide chips, protein chips, peptides, 

antibodies, probes and primers that are necessary for DNA 
microarrays , oligonucleotide microarrays, protein arrays, 
northern blotting, in situ hybridization, RNase protection 
assays, western blotting, ELISA assays, reverse transcription 
25 polymerase-chain reaction (hereinafter referred to as RT-PCR) 
to examine the expression of the genes and/or proteins whose 
expression level correlates with the differentiation grade of 
tumor . 

30 Background Art 

Cancer is the major causative of death in the world. 
Particularly, hepatocellular carcinoma (HCC) is one of the most 
common cancers worldwide, which represents a major 
international health problem because of its increasing 

35 incidence in many countries (Schafer, D.F. and Sorrell, M.F. 
Hepatocellular carcinoma, Lancet 353, 1253-1257 (1999) , 
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Colombo, M. Hepatitis C virus and hepatocellular carcinoma, 
Semln. Liver Dis. 19, 263-269 (1999) , and Okuda, K. 
Hepatocellular carcinoma, J. Hepatol. 32, 225-237 (2000)). 
Chronic hepatitis C virus (HCV) infection is one of the major 

5 risk factors for HCC as well as hepatitis B virus (HBV) 

infection, alcohol consumption, and several carcinogens such 
as af latoxin Bl (Okuda, K. Hepatocellular carcinoma, J. Hepatol. 
32, 225-237 (2000)) . Several therapies have been adopted for 
the treatment of HCC. Those include surgical resection, 

10 radiotherapy, chemotherapy , and biological therapy including 
hormonal and gene therapy. However, none of these therapies 
could cure the disease. One of the major problems of HCC 
treatment is that characteristics of cancer cells change during 
the development and progression of the disease. Particularly, 

15 changes in the differentiation grade of tumor cells are 

apparent and frequent. Such changes alter the ability of tumor 
cells to invade and metastasize and also the sensitivity of 
cancer cells to different therapies, causing resistance to 
anti-cancer agents. If the changes in the characteristics of 

20 cancer cells are precisely diagnosed and managed, cancer 
therapy will be more effective. 

Previous studies suggested the involvement of tumor 

suppressor genes and oncogenes such as p53, j3-catenin, and 
AXIN1 genes in hepatocarcinogenesis (Okabe, H. , Satoh, S. , Kato, 

25 T., Kitahara, O. , Yanagawa, R. , Yamaoka, Y. , Tsunoda, T. , 

Furukawa, Y., and Nakamura, Y. Genome-wide analysis of gene 
expression in human hepatocellular carcinomas using cDNA 
microarray: identification of genes involved in viral 
carcinogenesis and tumor progression, Cancer Res. 61, 

30 2129-2137 (2001)) . It has also been suggested that the 

development of HCV-associated HCC can be characterized by the 
pathological evolution from early to advanced tumor, which 
correlates with dedif f erentiation of cancer cells (Kojiro, M. 
Pathological evolution of early hepatocellular carcinoma, 

35 Oncology 62, 43-47 (2002)). Particularly after introduction 
of DNA microarray technologies into medical science (Schena, 
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M. , Shalon, D., Davis, R.W. , and Brown, P.O. Quantitative 
monitoring of gene expression patterns with a complementary DNA 
microarray, Science 270, 467-470 (1995), DeRisi, J., Penland, 
L., Brown, P.O., Bittner, M.L., Meltzer, P.S., Ray, M. , Chen, 

5 Y. , Su, Y. A. , and Trent, J.M. Use of a cDNA microarray to analyse 
gene expression patterns in human cancer, Nat. Genet. 14, 
457-460 (1996) ) , many studies showed gene-expression patterns 
relating to some aspects of HCC (Lau, W.Y., Lai, P.B., Leung, 
M.F., Leung, B.C., Wong, N. , Chen, G. , Leung, T.W., and Liew, 

10 C.T. Differential gene expression of hepatocellular carcinoma 
using cDNA microarray analysis , Oncol. Res. 12, 59-69 (2000), 
Tackels-Horne, D., Goodman, M.D., Williams, A.J., Wilson, D. J. , 
Eskandari, T., Vogt, L.M. , Boland, J.F., Scherf, U. , and 
Vockley, J.G. Identification of differentially expressed genes 

15 in hepatocellular carcinoma and metastatic liver tumors by 
oligonucleotide expression profiling, Cancer 92, 395-405 
(2001) , Xu, L. , Hui, L. , Wang, S. , Gong, J. , Jin, Y. , Wang, Y. , 
Ji, Y. , Wu, X., Han, Z., and Hu, G. Expression profiling 
suggested a regulatory role of liver-enriched transcription 

20 factors in human hepatocellular carcinoma, Cancer Res. 61, 
3176-3681 (2001), Xu, X.R., Huang, J., Xu, Z.G., Qian, B.Z., 
Zhu , Z . D . , Yan , Q . , Cai , T . , Zhang , X . , Xiao , H . S . , Qu , J. , Liu , 
F . , Huang, Q.H., Cheng, Z.H., Li, N.G., Du, J.J., Hu, W. , Shen, 
K.T., Lu, G., Fu, G., Zhong, M. , Xu, S.H., Gu, W.Y., Huang, W. , 

25 Zhao, X.T., Hu, G.X., Gu, J.R., Chen, Z., and Han, Z.G. Insight 
into hepatocellular carcinogenesis at transcriptome level by 
comparing gene expression profiles of hepatocellular carcinoma 
with those of corresponding non-cancerous liver, Proc. Natl. 
Acad. Sci. U.S.A. 98, 15089-15094 (2001), Okabe, H., Satoh, S., 

30 Kato, T. , Kitahara, O. , Yanagawa, R. , Yamaoka, Y. , Tsunoda, T. , 
Furukawa, Y. , and Nakamura, Y. Genome-wide analysis of gene 
expression in human hepatocellular carcinomas using cDNA 
microarray: identification of genes involved in viral 
carcinogenesis and tumor progression, Cancer Res. 61, 

35 2129-2137 (2001), Shirota, Y. , Kaneko, S., Honda, M. , Kawai, 
H.F. , and Kobayashi, K. Identification of differentially 
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expressed genes in hepatocellular carcinoma with cDNA 
microarrays , Hepatology 33, 832-840 (2001) , Delpuech, O. , 
Trabut, J.B. , Carnot, F., Feuillard, J. , Brechot, C. , and 
Kremsdorf, D. Identification , using cDNA macroarray analysis, 

5 of distinct gene expression profiles associated with 

pathological and virological features of hepatocellular 
carcinoma, Oncogene 21, 2926-2937 (2002), Iizuka, N. , Oka, M. , 
Yamada-Okabe , H. , Mori, N. , Tamesa, T . , Okada, T. , Takemoto, 
T., Tangoku, A., Hamada, K. , Nakayama, H., Miyamoto, T . , 

10 Uchimura, S_, and Hamamoto, Y. Comparison of gene expression 
profiles between hepatitis B virus- and hepatitis C 
virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method, Cancer 
Res. 62, 3939-3944 (2002), and Midorikawa, Y., Tsutsumi, S., 

15 Taniguchi, H. , Ishii, M. , Kobune, Y. , Kodama, T. , Makuuchi, M. , 
and Aburatani, H. Identification of genes associated with 
dedif f erentiation of hepatocellular carcinoma with expression 
profiling analysis, Jpn. J. Cancer Res. 93, 636-643 (2002)). 
Among them, two studies profiled gene expression of HCC in 

20 relation to its development (Okabe, H. , Satoh, S., Kato, T. , 
Kitahara, O. , Yanagawa, R. , Yamaoka, Y. , Tsunoda, T . , Furukawa, 
Y. , and Nakamura, Y. Genome-wide analysis of gene expression 
in human hepatocellular carcinomas using cDNA microarray: 
identification of genes involved in viral carcinogenesis and 

25 tumor progression, Cancer Res. 61, 2129-2137 (2001) and 

Midorikawa, Y. , Tsutsumi, S. , Taniguchi, H. , Ishii, M. , Kobune, 
Y. , Kodama, T. , Makuuchi, M. , and Aburatani , H. Identification 
of genes associated with dedif f erentiation of hepatocellular 
carcinoma with expression profiling analysis, Jpn. J. Cancer 

30 Res. 93, 636-643 (2002)) . However, nothing is known about genes 
and/or proteins that characterize and/or regulate each 
differentiation grade of HCC during the course of oncogenesis 
and development of HCV-associated HCC. Genes and/or proteins 
that regulate the differentiation grade of HCC can be used for 

35 diagnosing the differentiation grade of HCC and for screening 
anti-cancer agents for the treatment of HCC arising from 
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chronic HCV infection . 

In the present invention, the inventors describe a 
method of diagnosing the differentiation grade of tumor and 
screening anti-cancer agents for the treatment thereof. 
5 Particularly, the inventors describe a method of identifying 
40 or more genes and/or proteins whose expression correlates 
with the differentiation grade of HCC, and use of these genes 
and/or proteins for diagnosing the differentiation grade of HCC 
and for screening anti-cancer agents for the treatment of HCC 
10 in different grades. More particularly, the inventors describe 
a method of predicting non-cancerous liver, pre-cancerous 
liver, and each differentiation grade of HCC with 40 genes 
and/or proteins. 

15 Disclosure of the Invention 
Summary of the Invention 

Hepatocellular carcinoma (HCC) is one of the most common 
cancers worldwide. However, there is no therapy that can cure 
the disease. This is presumably due to sequential changes in 

20 characteristics of cancer cells during the development and 
progression of the disease. Particularly, progression of 
cancer is often associated with the changes of differentiation 
grade of tumor cells. Diagnosis and management of such changes 
of cancer cells will make cancer therapy more effective. In 

25 the present invention, genes whose expression correlates with 
oncogenesis and development of HCC are identified by 
oligonucleotide microarray representing approximately 11,000 
genes from 50 hepatitis C virus (HCV) -associated HCC tissues 
and 11 non-tumorous (non-cancerous and pre-cancerous) liver 

30 tissues. 

Differentiation states are divided into 5 grades. 
Non-cancerous liver (L0) is the liver that is histologically 
normal and is seronegative for both hepatitis B virus surface 
antigen and HCV antibody. Pre-cancerous liver (LI) is the liver 
35 that is HCV- infected and is histopathologically diagnosed as 
chronic hepatitis or liver cirrhosis. Well differentiated HCC 

■ 
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(Gl) is the HCC consisting of cancer cells that are 
characterized by an increase in cell density with elevated 
nuclear/ cytoplasm ratios compared to normal hepatocytes but 
show the morphologies similar to normal hepatocytes. 
5 Moderately differentiated HCC (G2) is the HCC consisting of 
cancer cells that are large and hyperchromatic . There are 
trabecular- or gland-like structures in cancer cell nest in G2 
grade. Poorly differentiated HCC (G3) is the HCC consisting 
of the cancer cells that are pleomorphic or multinucleate. The 

10 tumor grows in solid masses or cell nest devoid of architectural 
arrangement in G3 grade. Gl, G2 , and G3 tumors correspond to 
types I, II, and III of Edmondson & Steiner classification, 
respectively (Edmondson, H.A. and Steiner, P.E. Primary 
carcinoma of the liver: a study of 100 cases among 48,900 

15 necropsies, Cancer 7, 462-504 (1954)). 

A supervised learning method followed by a random 
permutation test of oligonucleotide microarray data is used to 
select genes whose expression significantly changes during the 

20 transition from non-cancerous liver without HCV infection (L0) 
to pre-cancerous liver with HCV infection (LI) , from LI to well 
differentiated HCC (Gl) , from Gl to moderately differentiated 
HCC (G2) , and from G2 to poorly differentiated HCC (G3) . 
Self-organizing map with all the selected 40 genes whose 

25 expression is significantly altered in each transition stage 
can correctly predict the differentiation grade of tumor 
tissues. Thus, these genes can be used for diagnosing the 
differentiation grade of HCC and for screening anti-cancer 
agents for the treatment of HCC in each differentiation grade. 

30 

Detailed Description of the Invention 

In the present invention, human hepatocellular 
carcinoma (HCC) tissues and non-tumorous (non-cancerous and 
pre-cancerous) liver tissues are used. HCCs with HCV infection 
35 are used for analyzing HCCs. Presence of HCV and/or HBV 

infection can be determined either by immunoreactivity against 
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anti-HCV antibody and anti-HBV antibody or by amplifying HCV 
and/or HBV genome by PCR. The differentiation grade of HCC can 
be determined by histopathological examination, and HCCs are 
classified into well differentiated HCC (Gl) , moderately 

5 differentiated HCC (G2) , and poorly differentiated HCC (G3) . 
Non-tumorous liver samples can be obtained from patients who 
underwent hepatic resection for benign or metastatic liver 
tumors. A liver sample without HCV infection is classified as 
non-cancerous liver (L0) , and that with HCV infection is 

10 classified as pre-cancerous liver (LI) . After resecting liver 
tissues during surgery, it is preferable that tissues are 
immediately frozen in liquid nitrogen or acetone containing dry 
ice and stored at between -70 and -80°C until use. The tissues 
may or may not be embedded in O.C.T. compound (Sakura-Seiki , 

15 Tokyo, Japan, Catalog No. 4583) . 

The expression of genes and/or proteins of HCC tissues 
and non-tumorous liver tissues can be analyzed by measuring the 
level of RNA and/or proteins. In most cases, the level of RNA 
and/or proteins is determined by measuring fluorescence from 

20 substances including fluorescein and rhodamine, 

chemiluminescence from luminole, radioactivity of radioactive 
materials including 3 H, 14 C, 35 S, 33 P, 32 P, and 125 I, and optical 
density. For example, the expression level of RNA and/or 
proteins is determined by known methods including DNA 

25 microarray. (Schena, M. et al . Quantitative monitoring of gene 
expression patterns with a complementary DNA microarray, 
Science 270, 467-470 (1995) and Lipshutz , R.J. et al . High 
density synthetic oligonucleotide arrays, Nat. Genet. 21, 
20-24 (1999)) , RT-PCR (Weis, J.H. et al . Detection of rare mRNAs 

30 via quantitative RT-PCR, Trends Genet. 8, 263-264 (1992) and 
Bustin, S.A. Absolute quantification of mRNA using real-time 
reverse transcription polymerase chain reaction assays, J. Mol. 
Endocrinol. 25, 169-193 (2000) ) , northern blotting and in situ 
hybridization (Parker, R.M. and Barnes, N.M. mRNA: detection 

35 in situ and northern hybridization, Methods Mol. Biol. 106, 
247-283 (1999)), RNase protection assay (Hod, Y. A. Simplified 
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ribonuclease protection assay, BioTechnlques 13, 852-854 
(1992) and Saccomanno, C.F. et al. A faster ribonuclease 
protection assay, BioTechnlques 13, 846-850 (1992)), western 
blotting (Towbin, H. et al. Electrophoretic transfer of 
proteins from polyacrylamide gels to nitrocellulose sheets, 
Proc. Natl. Acad. Sci. U.S.A. 76, 4350-4354 (1979) and Burnette , 
W.N. Western blotting: Electrophoretic transfer of proteins 
form sodium dodecyl sulf ate-polyacrylamide gels to unmodified 
nitrocellulose and radioiodinated protein A, Anal. Blochem. 
112, 195-203 (1981)), ELISA assay (Engvall, E. andPerlman, P. 
Enzyme-linked immunosorbent assay (ELISA) : Quantitative assay 
of immunoglobulin G, Immuno chemistry 8 , 871-879 (1971)), and 
protein array (Merchant, M. and Weinberger , S.R. Review: Recent 
advancements in surface-enhanced laser 

desorption/ionization-time of flight-mass spectrometry, 
Electrophoresis 21 , 1164-1177 (2000) andPaweletz, CP. etal. 
Rapid protein display profiling of cancer progression directly 
from human tissue using a protein biochip, Drug Dev. Res. 49, 
34-42 (2000) ) . 

Genes and/or proteins that are differently expressed in 
each differentiation grade of HCC and non- tumorous 
(non-cancerous and pre-cancerous) liver are selected by 
comparing the expression level of genes and/or proteins among 
HCC tissues in each differentiation grade and non-tumorous 
liver tissues. Genes and/or proteins that are differentially 
expressed between non-cancerous liver (L0) and pre-cancerous 
liver that have been infected with HCV (LI) are identified by 
comparing the expression level of each gene and/or protein 
between non-cancerous liver tissues and pre-cancerous liver 
tissues. Genes and/or proteins that are differentially 
expressed between pre-cancerous liver (LI) and well 
differentiated HCC (Gl) are identified by comparing the 
expression level of each gene and/or protein between 
pre-cancerous liver tissues and well differentiated HCC 
tissues (HCC(Gl)). Genes and/or proteins that are 
differentially expressed between well differentiated HCC (Gl) 
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and moderately differentiated HCC (G2) are identified by 
comparing the expression level of each gene and/or protein 
between HCC(Gl) and moderately differentiated HCC tissues 

(HCC(G2)). Similarly, genes and/or proteins that are 
differentially expressed between moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) are identified by 
comparing the expression level of each gene and/or protein 
between HCC(G2) and poorly differentiated HCC tissues 

(HCC(G3) ) . 



Differences in the expression level of genes and/or 
proteins of non-cancerous liver, pre-cancerous liver, well 
differentiated HCC, moderately differentiated HCC, and poorly 
differentiated HCC can be analyzed and detected by known 

15 methods of statistical analyses . In all experiments for 

comparing the expression level of genes and/or proteins between 
two grades selected from L0 , LI, Gl , G2 , and G3 , the following 
procedures are taken. 

In the first step, genes and/or proteins with certain 

20 expression level (e.g. genes with expression level greater than 
40 as judged by the arbitrary units by Affymetrix gene chip 
results) in all the HCC samples and in the non-cancerous and 
pre-cancerous liver samples are selected. This selection 
results in certain number of genes and/or proteins. Then, the 

25 discriminatory ability of each gene and/or protein to 

discriminate L0 from LI, LI from Gl , Gl from G2 , and G2 from 
G3 is determined by the Fisher ratio. The Fisher ratio for a 
gene j is given by 



30 where fij(i) is the sample mean of the expression level of gene 

j for the samples in Grade i, and is the sample variance 

of the expression level of gene j for the samples in Grade i. 
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In the second step, the selected genes and/or proteins 
are ranked in the order of decreasing magnitude of the Fisher 
ratio . A random permutation test is also performed to determine 

5 the number of genes and/or proteins to define the 

differentiation grade of HCC. In the permutation test, sample 
labels are randomly permuted between two grades to be compared, 
and the Fisher ratio for each gene and/or protein is again 
computed. This random permutation of sample labels is repeated 

10 1,000 times. The Fisher ratios generated from the actual data 
are assigned Ps based on the distribution of the Fisher ratios 
from randomized data. From the distribution of the Fisher 
ratios based on the randomized data, the genes and/or proteins 
that are determined to be statistically significant in two 

15 grades by the random permutation test are selected. More 

particularly, the genes and/or proteins that have the P value 
less than 0.005 by the random permutation test between the two 
grades are selected. Among these selected genes and/or 
proteins, 40 genes and/or proteins having the highest Fisher 

20 ratios in each comparison between non-cancerous liver (L0) and 
pre-cancerous liver (LI) , pre-cancerous liver (LI) and well 
differentiated HCC (Gl) , well differentiated HCC (Gl) and 
moderately differentiated HCC (G2) , moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) are further 

25 selected. 



The ability of the selected 40 genes and/or proteins to 
distinguish non-cancerous liver (L0) from pre-cancerous liver 
(LI) , pre-cancerous liver (LI) from well differentiated HCC 

30 (Gl) , well differentiated HCC (Gl) from moderately 

differentiated HCC (G2) , moderately differentiated HCC (G2) 
from poorly differentiated HCC (G3) is verified by the minimum 
distance classifier and the self-organizing map (SOM) . 

The minimum distance classifier is designed using the 40 

35 genes and/of proteins selected in each transition stage. The 
expression level of each gene and/or protein is normalized to 
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have zero mean and unit variance using all the training samples 
from two grades. After measuring the Euclidean distance 
between a sample and each mean vector, the sample is assigned 
to the grade of the nearest mean vector. The minimum distance 
classifier that is created with the selected 40 genes and/or 
proteins in each transition stage is also used to predict the 
differentiation grade of HCC samples whose differentiation 
grade is not determined. To diagnose the differentiation grade 

of HCCs, using fij(A) and fij(B) previously described, the 

sample mean /ij of the mixture consisting of Grades A and B on 
a gene j is obtained by 



N A +N B 



N A +N B 



where N, is the number of samples from Grade i. Next, the 
sample variance a) of the mixture consisting of Grades A and 



15 



B on the gene j is obtained by 



~ 2 

a, = 



N A +N„-l 



(N A -l)&j(A) + (N B -l)&](B)+ NaN ° 



Using fij and a) , ft and V are defined by 



1 



a. 



0 



0 



40 _ 



20 



Then, a sample x is normalized by 



x=V T (x-fi) 
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where x is the normalized sample. Using the normalized 
samples, the sample mean vector for each grade is obtained. In 
the minimum distance classifier, the score value is computed 
by 

5 r I (5)Mi5-/i L0 n a -n5^ u n a 
r 2 (i)=iiJ-// L1 n 2 -iii-^ cl ii 2 

m • 

T 3 (x) =!l£-/i cl II 2 -UZ-Mgi » 2 

Using four minimum distance classifiers, the differentiation 
grade of HCCs can be diagnosed as follows : 

(i) A normalized sample x is classified into Grade L0 
if r,(jc)<0 / r 2 (x)<0, r 3 (ic)<0 and T 4 (x)<0. 

(ii) A normalized sample x is classified into Grade LI 
if r i (x)>0 / r 2 (x)<0, T 3 (x)<0 and T 4 (x) <0 . 

(iii) A normalized sample x is classified into Grade Gl 
if T l (jc)>0 / r 2 (x)>0, T 3 (x)<0 and T 4 (jc)<0. 

(iv) A normalized sample x is classified into Grade G2 
if T l (x)>0, T 2 (x)>0, T 3 (x)>0 and T 4 (x)<0. 

(v) A normalized sample x is classified into Grade G3 

if r 1 (ic)>o f r 2 (£)>o, r 3 (jf)>o and r 4 (x)>o. 

The SOM is a neural network algorithm widely used for 
clustering and is well known as an efficient tool for the 
visualization of multidimensional data (Tamayo, P. et a J . 
25 Interpreting patterns of gene expression with self-organizing 
maps : methods and application to hematopoietic differentiation, 
Proc. Natl. Acad. Sex. U.S.A. 96, 2907-2912 (1999) and Sultan, 
M. et al. Binary tree-structured vector quantization approach 
to clustering and visualizing microarray data, Bioinf ormatics 
30 Suppl 1, S111-S119 (2002)) . The SOM with all the selected 40 
genes and/or proteins is carried out according to the method 
of MATLAB R13 with the SOM toolbox available in the web site, 
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http://www.cis.hut.fi/projects/somtoolbox/ (Kohonen, 2001) . 

Each set of forty genes and/or proteins whose expression 
is significantly altered during the transition from 
non-cancerous liver (LO) to pre-cancerous liver (Ll) , from 
pre-cancerous liver (Ll) to well differentiated HCC (Gl) , from 
well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) , from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) is used for diagnosing the grade of 
hepatocarcinogenesis of HCC, and also for screening 
anti-cancer agents that are used for the treatment of HCC in 
each grade. 

Each set of forty genes and/or proteins whose expression 
is significantly altered during the transition from 
non-cancerous liver (LO) to pre-cancerous liver (Ll) , from 
pre-cancerous liver (Ll) to well differentiated HCC (Gl) , from 
well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) , from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) is expressed in bacteria , eukaryotic 
cells, and cell-free systems. Agents that affect the 
expression and/or function of the genes and/or proteins are 
screened by monitoring the expression and/or function. 
Monoclonal antibodies against the proteins are also raised and 
used for treating HCC in different grades . As monoclonal 
antibodies, whole mouse monoclonal antibodies, humanized 
antibodies, chimeric antibodies, single chain antibodies, 
divalent single chain antibodies, and/or bi-specific 
antibodies can be raised against the purified proteins, and 
they are used for diagnosing the grade of HCC and the treatment 
thereof. 

A kit to examine the expression of the genes and/or 
proteins is also created. The kit consists of the components 
including reagents for an RNA extraction, enzymes for synthesis 
of cDNA and cRNA, DNA chips, oligonucleotide chips, protein 
chips, probes and primers for the genes, DNA fragments of 
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control genes , and antibodies to the proteins . The components 
of the kit are easily available from the market. 

Brief Description of the Drawings 

5 Fig. 1 illustrates color displays of the expression of 152 genes 
whose expression was significantly altered during the 
transition from L0 to LI (a) , 191 genes whose expression was 
significantly altered during the transition from LI to Gl (b) , 
54 genes whose expression was significantly altered during the 

10 transition from Gl to G2 (c) , and 40 genes whose expression was 
significantly altered during the transition from G2 to G3 (d) . 
Panels e, f , g, and h illustrate expression of the selected 40 
genes in each transition stage in all the samples. Expression 
of the selected 40 genes whose expression was significantly 

15 altered during the transition from L0 to LI (e) , from LI to Gl 
(f) , from Gl to G2 (g) , and from G2 to G3 (h) is shown. The 
selected 40 genes in each transition stage discriminate samples 
before and after the transition. Genes are shown in decreasing 
order of the Fisher ratio and are indicated by GenBank accession 

20 numbers . 

The name of each sample is indicated on top of each photo (e-h) ; 
NL-64, NL-65, NL-66, NL-67, NL-68, NL-69, IL-49, IL-58, IL-59, 
IL-60, IL-62, G1-26T, G1-42T, G1-85T, G1-86T, G1-87T, G1-147T, 
G1-165T, G2-1T, G2-2T, G2-6T, G2-8T, G2-10T, G2-12T, G2-16T, 

25 G2-18T, G2-20T, G2-22T, G2-23T, G2-27T, G2-28T, G2-29T, G2-31T, 
G2-34T, G2-37T, G2-43T, G2-45T, G2-46T, G2-49T, G2-58T, G2-59T, 
G2-60T, G2-62T, G2-89T, G2-90T, G2-105T, G2-151T, G2-155T, 
G2-161T, G2-162T, G2-163T, G2-171T, G2-182T, G3-19T, G3-21T, 
G3-25T, G3-35T, G3-80T, G3-81T, G3-107T, G3-174T, from the 

30 left. 

The name of each gene is indicated on the right of the photo. 
In the case of panel e, M18533, AF035316, AL049942, L27479, 
n Fibronectin, Alt. Splice 1" , U19765, X55503, AL046394, 
AB007886, AL050139, AF012086, AI539439, M19828, U92315, D76444, 
35 X02761, AF001891, AI400326, AI362017, L13977, D32053, AF038962, 
AL008726, J03909, Z69043, AL080080, M63138, L09159, AF017115, 
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M13560, M36035, U47101, U81554, M21186, D32129, AL022723, 
M83664, U50523, M81757, AF102803, from the top. In the case 
of panel f , M93221, AF079221, V01512, D88587, U12022, AF055376, 
R93527, R92331, U83460, AF052113, H68340, M10943, M13485, 
5 U75744, X02544, M93311, Z24725, U22961, M62403, M35878, U84011, 
AF055030, L13977, D13891, M63175, AB023157, U20982, M14058, 
AL049650, U61232, AI991040, U64444, D63997, X55503, AL080181, 
X76228, AB018330, D76444, U70660, U10323, from the top. In the 
case of panel g, M87434, M12963, AI625844, M97936, Z99129, 

10 L07633, D50312, U07364, AA883502, M97935, AF061258, AB007447, 
M97935, W28281, M97935, Y00281, D28118, AF104913, AA675900, 
L27706, D32050, M63573, AF014398, X70944, U70671, AA447263, 
AB014569, M23115, D38521, X00351, L11672, X82834, AB007963, 
U76247, X68560, AB015344, AB018327, AF004430, D14697, AB028449, 

15 from the top. In the case of panel h f AA976838 , Z11793 , AB002311 , 
Y18004, AL031230, AF002697, AB014596, U49897, AF070570, M80482, 
AI263099, U22961, Z24725, U77594, L34081, M88458, U68723, 
X92098, D10040, AB023194, AF001903, X96752, AB006202, M75106, 
Y12711, D14662, S87759, Z48199, AF088219, AA453183, D31767, 

20 AB000095, AB006782, M21186, AB002312, U44772, AI541308, Z49107, 
U77735, M38449, from the top. 

Fig. 2 illustrates the validation of the selected 40 genes in 
each transition stage to distinguish the differentiation grade 
25 of HCC. 

In each transition, from L0 to LI (a) , from LI to Gl (b) , From 
Gl to G2 (c) , and from G2 to G3 (d) , the minimum distance 
classifier was constructed with the samples in consecutive two 
differentiation grades as indicated by the red bar (training 
30 samples) , and was applied to the samples in the remaining 
differentiation grades as indicated by the black bar (test 
samples) . The resulting classifier classified the test samples 
with the accuracy of 92% (a) , 98% (b) , 84% (c) , and 100% (d) . 

35 Fig. 3 illustrates the result of analysis by the 

self-organizing map (SOM) algorithm of the genes whose 
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expression changed during the transition from non-cancerous 
liver (L0) to pre-cancerous liver (LI) , from pre-cancerous 
liver (LI) to well differentiated HCC (Gl) , from well 
differentiated HCC (Gl) to moderately differentiated HCC (G2) , 
5 and from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) . 

Fig. 3a illustrates clusters of the samples (Table 1) . 
Each cell in the SOM grid corresponds to one cluster. The 
vectors of neighboring cells are usually located close to each 
10 other . 

(m, n) , index of the cell located at m-th row and n-th column. 

w 

NL-XX, samples from non-cancerous liver without HCV infection 

* 

(L0) ; IL-XX, samples from HCV-infected pre-cancerous liver 
(LI) ; Gl-XXT, samples from well differentiated HCC (Gl) ; G2-XXT, 

15 samples from moderately differentiated HCC (G2) ; G3-XXT , 
samples from moderately differentiated HCC (G3) . 
The map shows that the samples clearly formed a sigmoid curve 
in the order of L0 , LI , Gl , G2 , and G3 . G2 samples without vessel 
involvement (blue letters) are located close to Gl samples and 

20 G2 samples with vessel involvement (red letters) are located 
close to G3 samples . 

Fig. 3b illustrates the distance between the neighboring 
clusters . 

(m, n) , index of the cell located at m-th row and n-th column. 
25 The color of the cells . indicates the distance between the 

neighboring clusters; a red color means a long distance. The 
red cells in the upper area clearly show that the non-tumorous 
(non-cancerous and pre-cancerous) liver samples and HCC 
samples are relatively far apart in all the selected 40 genes. 

30 

Table 1 illustrates clusters of samples profiled to L0, LI, Gl , 
G2, and G3 as shown in Fig. 3a. 

Table 2 illustrates clinicopathologic factors of the HCC used 
in the present invention. 
35 Table 3 illustrates top-40 discriminatory genes in L0 and LI. 
Table 4 illustrates top-40 discriminatory genes in LI and Gl . 
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Table 5 illustrates top-40 discriminatory genes in Gl and G2 . 
Table 6 illustrates top-40 discriminatory genes in G2 and G3 . 

Best Mode for Carrying out the Invention 

5 The following examples merely illustrate the preferred 

method for identification and use of genes and/or proteins that 
are differently expressed in non-cancerous liver, 
pre-cancerous liver , well differentiated HCC, moderately 
differentiated HCC, and poorly differentiated HCC. 

10 Herein below, the present invention will be 

specifically described using examples, however, it is not to 
be construed as being limited thereto . 

Example 1* Preparation of human tissues 

15 Fifty patients underwent surgical treatment for HCC at 

Yamaguchi University Hospital between May 1997 and August 2000 . 
Written informed consent was obtained from all patients before 
surgery- The study protocol was approved by the Institutional 
Review Board for the Use of Human Subjects at the Yamaguchi 

20 University School of Medicine, All of the 50 patients were 
seropositive for HCV antibody (HCVAb) and seronegative for 
hepatitis B virus surface antigen (HBsAg) . A histopathological 
diagnosis of HCC was made in all cases after surgery. This 
histopathological examination showed that seven patients had 

25 well differentiated HCC (Gl) , 35 had moderately differentiated 
HCC (G2) , and the remaining eight had poorly differentiated HCC 
(G3) . Clinicopathologic factors were determined according to 
the International Union against Cancer TNM classification. 
Fisher's exact test, Student's t test, and Mann-Whitney's U 

30 test were used to elucidate the differences in 

clinicopathologic characteristics among the 3 grades, Gl, G2 
and G3 HCC. P<0.05 was considered significant. 

Six non-cancerous liver samples were obtained from six 
patients who underwent hepatic resection for benign or 

35 metastatic liver tumors, and confirmed to have histologically 
normal livers. They were all seronegative for both HBsAg and 



17 



WO 2004/090163 PCT/JP2003/004458 



HCVAb . Five HCV-infected liver samples were also prepared from 
the non- tumorous areas of five patients with HCC. All five 
liver samples were histopathologically diagnosed as chronic 
hepatitis or liver cirrhosis. Informed consent in writing was 
5 obtained from all patients before surgery. 

Example 2. Clinicopathologic characteristics of HCCs 

Histological examinations showed that, among the 50 
. HCV-associated HCCs enrolled in this study, seven were well 

10 differentiated HCC (Gl) , 35 were moderately differentiated HCC 
(G2) , and the remaining eight were poorly differentiated HCC 
(G3) (Table 2) . The tumor size of G2 and G3 HCCs was 
significantly larger than that of Gl HCC (p=0 . 0007 and p=0 . 028 , 
respectively, by Mann-Whitney's U test) . The incidence of 

15 vessel involvement in G2 and G3 HCCs was significantly higher 
than that in Gl HCC (p=0.038 by Fisher's exact test). In 
parallel to dedif f erentiation from Gl to G3 , tumor stage was 
more advanced (p=0 .066 by Fisher ' s exact test) . Thus , each type 
of Gl , G2 , and G3 HCCs enrolled in this study showed 

20 characteristics corresponding to dedif f erentiation, i.e., 

tumor size, metastatic potential, and tumor stage, as proposed 
by Kojiro (Kojiro, M. Pathological evolution of early 
hepatocellular carcinoma, Oncology 62, 43-47 (2002)). 

25 Example 3. Extraction of the RNA from tissues 

Pieces of the tissues (about 125 mm 3 ) were suspended in 
TRIZOL (Life Technologies, Gaithersburg, USA, Catalog No. 
15596-018) or Sepasol-RNAI (Nacalai tesque, Kyoto, Japan, 
Catalog No. 306-55) and homogenized twice with a Polytron 

30 (Kinematica, Littau, Switzerland) (5 sec at maximum speed) . 
After addition of chloroform, the tissues homogenates were 
centrifuged at 15,000 x g for 10 min, and aqueous phases, which 
contained RNA, were collected. Total cellular RNA was 
precipitated with isopropyl alcohol, washed once with 70% 

35 ethanol, and suspended in DEPC-treated water (Life 

Technologies, Gaithersburg, USA, Catalog No. 10813-012). 
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After treated with 1.5 units of DNase I (Life Technologies, 
Gaithersburg, USA, Catalog No. 18068-015), the RNA was 
re-extracted with TRIZOL/chlorof orm, precipitated with 
ethanol, and dissolved in DEPC-treated water. Thereafter, 

5 small molecular weight nucleotides were removed by using RNeasy 
Mini Kit (QIAGEN, Hilden, Germany, Catalog No . 74104) according 
to a manufacturer's instruction manual. Quality of the total 
RNA was judged from the ratio of 28S and 18S ribosomal RNA after 
agarose gel electrophoresis. The purified total RNA was stored 

10 at -80 °C in 70% ethanol solution until use. 

Example 4. Synthesis of cDNA and labeled cRNA probes 

cDNA was synthesized by using reverse Superscript 
Choice System (Life Technologies, Gaithersburg, USA, Catalog 

15 No. 18090-019) according to the manufacturer's instruction 
manual . Five micrograms of the purified total RNA were 
hybridized with oligo-dT primers (Sawady Technology, Tokyo, 
Japan) that contained sequences for the T7 promoter and 200 
units of SuperScriptll reverse transcriptase and incubated at 

20 42 °C for 1 hr. The resulting cDNA was extracted with 

phenol/chloroform and purified with Phase Lock Gel™ Light 
(Eppendorf, Hamburg, Germany, Catalog No. 0032 005.101). 

cRNA was also synthesized by using MEGAscript T7 kit 
(Ambion, Austin, USA, Catalog No. 1334) and cDNA as templates 

25 according to the manufacturer's instruction. Approximately 5 
pg of the cDNA was incubated with 2 pi of enzyme mix containing 
T7 polymerase, 7.5 mM each of adenosine triphosphate (ATP) and 
guanosine triphosphate (GTP) , 5.625 mM each of cytidine 
triphosphate (CTP) and uridine triphosphate (UTP) , and 1.875 

30 mM each of Bio-ll-CTP and Bio-16-UTP (ENZO Diagnostics, 

Farmingdale, USA, Catalog No. 42818 and 42814, respectively) 
at 37 °C for 6 hr. Mononucleotides and short oligonucleotides 
were removed by column chromatography on CHROMA SPIN +STE-100 
column (CLONTECH, Palo Alto, USA, Catalog No. K1302-2) , and the 

35 cRNA in the eluates was sedimented by adding ethanol. Quality 
of the cRNA was judged from the length of the cRNA after agarose 
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gel electrophoresis. The purified cRNA was stored at -80 °C 
in 70% ethanol solution until use. 

Example 5. Gene expression analysis of HCC in different 
5 differentiation grade 

Gene expression of human primary tumors from glioma 
patients was examined by high-density oligonucleotide 
microarrays (U95A array, Affymetrix, Santa Clara, USA, Catalog 
No. 510137) (Lipshutz, R.L. et al . High density synthetic 

10 oligonucleotide arrays, Nat. Genet. 21, 20-24 (1999)). For 
hybridization with oligonucleotides on the chips, the cRNA was 
fragmented at 95 °C for 35 min in a buffer containing 40 mM Tris 
(Sigma, St. Louis, USA, Catalog No . T1503) -acetic acid (Wako , 
Osaka, Japan, Catalog No. 017-00256) (pH 8.1) , 100 mM potassium 

15 acetate (Wako, Osaka, Japan, Catalog No. 160-03175), and 30 mM 
magnesium acetate (Wako, Osaka, Japan, Catalog No. 130-00095) . 
Hybridization was performed in 200 pi of a buffer containing 
0.1 M 2- (N-Morpholino) ethanesulf onic acid (MES) (Sigma, St. 
Louis, USA, Catalog No. M-3885) (pH 6.7), 1 M NaCl (Nacalai 

20 tesque, Kyoto, Japan, Catalog No. 313-20), 0.01% 

polyoxylene (10) octylphenyl ether (Wako, Osaka, Japan, Catalog 
No. 168-11805) , 20 pg herring sperm DNA (Promega, Madison, USA, 
Catalog No. D181B) , 100 ]ig acetylated bovine serum albumin 
(Sigma, St. Louis, USA, Catalog No. B-8894) , 10 ]ig of the 

25 fragmented cRNA, and biotinylated-control oligonucleotides, 
biotin-5 ' — CTGAACGGTAGCATCTTGAC— 3 ' ( Sawady technology , Tokyo , 
Japan) , at 45 °C for 12 hr . After washing the chips with a buffer 
containing 0.01 M MES (pH 6.7), 0.1 M NaCl, and 0.001% 
polyoxylene (10) octylphenyl ether buffer, the chips were 

30 incubated with biotinylated anti-streptavidin antibody 

(Funakoshi, Tokyo, Japan, Catalog No . BA0500) and stained with 
streptavidin R-Phycoerythrin (Molecular Probes, Eugene, USA, 
Catalog No. S-866) to increase hybridization signals as 
described in the instruction manual (Affymetrix, Santa Clara, 

35 USA) . Each pixel level was collected with laser scanner 

(Affymetrix, Santa Clara, USA) and levels of the expression of 



20 



WO 2004/090163 PCT/JP2003/004458 



each cDNA and reliability (Present/Absent call) were 
calculated with Affymetrix GeneChip ver. 3.3 and Affymetrix 
Microarray Suite ver. 4.0 softwares. From these experiments, 
expression of approximately 11,000 genes in the human primary 
5 tumors of glioma patients was determined. 

Example 6. Statistical analysis of the oligonulceotide 
microarray data 

Genes with average differences greater than 40 

10 (arbitrary units by Affymetrix) in all the 50 HCC samples and 
the 11 non-tumorous (non-cancerous and pre-cancerous) liver 
samples were selected. This procedure yielded 3,559 genes out 
of approximately 11,000. Next, the Fisher ratio was determined 
(Iizuka, N. , Oka, M. , Yamada-Okabe , H. , Mori, N. , Tamesa, T . , 

15 Okada, T. , Takemoto, T. , Tango ku, A. , Hamada, K. , Nakayama, H. , 
Miyamoto, T. , Uchimura, S. , and Hamamoto, Y. Comparison of gene 
expression profiles between hepatitis B virus- and hepatitis 
C virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method, Cancer 

20 Res. 62, 3939-3944 (2002) and Luo, J., Duggan, D.J., Chen, Y. , 
Sauvageot, J., Ewing, CM., Bittner, M.L., Trent, J.M. , and 
Isaacs, W.B. Human prostate cancer and benign prostatic 
hyperplasia: molecular dissection by gene expression profiling, 
Cancer .Res. 61, 4683-4688 (2001)) to evaluate these genes as 

25 discriminators of L0 from LI , LI from Gl , Gl from G2 , and G2 
from G3 . The above 3,559 genes were ranked in the order of 
decreasing magnitude of the Fisher ratio. A random permutation 
test was also performed to determine the number of genes to 
define the differentiation grade of HCC. The random 

30 permutation test was carried out as described previously 

(Iizuka, N. , Oka, M. , Yamada-Okabe , H. , Mori, N. , Tamesa, T. f 
Okada, T. , Takemoto, T. , Tango ku, A. , Hamada, K. , Nakayama, H. , 
Miyamoto, T. , Uchimura, S. , and Hamamoto, Y. Comparison of gene 
expression profiles between hepatitis B virus- and hepatitis 

35 C virus-infected hepatocellular carcinoma by oligonucleotide 
microarray data based on a supervised learning method, Cancer 
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Res. 62, 3939-3944 (2002) andLuo, J. , Duggan, D.J., Chen, Y. , 
Sauvageot, J., Ewing, CM., Bittner, M.L. , Trent, J.M. , and 
Isaacs, W.B. Human prostate cancer and benign prostatic 
hyperplasia: molecular dissection by gene expression profiling, 

5 Cancer Res. 61, 4683-4688 (2001)). In the test, sample labels 
were randomly permuted between two grades to be considered, and 
the Fisher ratio for each gene was again computed. This random 
permutation of sample labels was repeated 1,000 times. The 
Fisher ratios generated from the actual data were then assigned 

10 Ps based on the distribution of the Fisher ratios from 

randomized data. From the distribution of the Fisher ratios 
based on the randomized data, all of the genes that could pass 
the random permutation test (P< 0.005) were selected. This 
procedure was performed in all experiments for the comparison 

15 of two grades. As a result, 152 genes with the Fisher ratios 
higher than 4.90 were statistically significant discriminators 
between L0 and LI. Likewise, 191 genes with the Fisher ratios 
higher than 4.08 to discriminate LI from Gl, 54 genes with the 
Fisher ratios higher than 1.52 to discriminate Gl from G2 , and 

20 40 genes with the Fisher ratios higher than 1.34 to discriminate 
G2 from G3 , were identified. 

Example 7 . Selection of genes whose expression correlates with 
differentiation grade of HCC 

25 With oligonucleotide array data, changes in the gene 

expression during oncogenesis, i.e. , from non-cancerous liver 
(L0) to HCV-infected pre-cancerous liver (LI) and from LI to 
well differentiated HCC (Gl) , and during dedif f erentiation of 
HCC (Gl to G2 and G2 to G3) were analyzed. The supervised 

30 learning method followed by a random permutation test 

identified 152 genes whose expression level was significantly 
changed during the transition from L0 to LI . Among the 152 genes , 
67 were upregulated and 85 were downregulated during this 
transition. In the same manner, 191 genes whose expression 

35 level was significantly changed during the transition from LI 
to Gl HCC were identified. Among the 191 genes, 95 were 
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upregulated and 96 were downregulated during this transition. 
Fifty-four genes appeared to be differentially expressed 
between Gl and G2 HCCs , and among them the expression of 36 genes 
was increased and that of 18 genes was decreased during the 
transition from Gl to G2 . Forty genes turned out to be 
differentially expressed between G2 and G3 HCCs, and among them 

* 

the expression of 10 genes was increased and that of 30 genes 
was decreased during the transition from G2 to G3 . 

To examine performance of the genes selected in each 
grade in the oncogenesis and development of HCC, the inventors 
applied data of these genes to all samples. As a result, almost 
all of these genes selected in each transition stage were placed 
in L0-L1 transition, Ll-Gl transition, G1-G2 transition, and 
G2-G3 transition. For example, the 191 genes that discriminate 
LI from Gl HCC could clearly distinguish non-tumorous livers 
(L0 and Ll) from HCCs (Gl, G2 , and G3) (Fig. 1) . These results 
indicate that altered level of the selected genes plays central 
roles in determining each grade of HCC pathogenesis. 

20 Example 8 . Genes whose expression changed during the transition 
from non-cancerous liver (L0) to pre-cancerous liver (Ll) 

Expression of most of immune response-related genes, 

■ 

metabolism-related genes, transport-related genes, 
proteolysis-related genes, and oncogenesis-related genes was 

25 increased, and that of transcription-related genes was 
decreased during the transition from L0 to Ll (Table 3) . 

Immune response-related genes include MHC class I 
family (HLA-A, -C, -E, and -F) , MHC class II family (HLA-DPBl 
and HLA-DRA) , CD74, NK4 , LILRB1 , FCGR3B, and IFI30. 

30 Upregulation of an interferon (IFN) inducible gene such as 
IFI30 may represent host defense against viral infection; 
however, it should be noted that several IFN-related genes were 
decreased during dedif f erentiation of Gl to G2 as mentioned in 
the following section (see Example 10) . 

35 Metabolism-related genes include KARS, ALDOA, AS AH , MPI, 

and GAPD. Increased levels of KARS and ALDOA enhance protein 

* 
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biosynthesis and glycolysis, respectively. Upregulaton of 
ASAH, MPI, and GAPD augments biosynthesis of fatty acid, 
mannose, and glyceraldehyde , respectively. 

Transport-related genes include VDAC3 , SSR4 , BZRP, and 
ATOX1 . SSR4 is responsible for the effective transport of newly 
synthesized polypeptides. AT0X1 is a copper transporter and 
an increase in its expression causes activation of various 
metabolic pathways , because many enzymes require copper ion as 
a cof actor of enzymatic activity. 

Proteolysis-related genes include CST3 and CTSD. CST3 
is involved in vascular formation. Increased serum level of 
CTSD protein was observed in cirrhotic patients who may develop 
pre-cancerous hepatic nodules (Leto, G. , Tumminello, F.M. , 
Pizzolanti, G. , Montalto, G. , Soresi, M. , Ruggeri, I., and 
Gebbia, N. Cathepsin D serum mass concentrations in patients 
with hepatocellular carcinoma and/or liver cirrhosis, Eur. J. 
Clin. Chem. Clin. Biochem. 34, 555-560 (1996)). 

Oncogenesis-related genes include MBD2 , RPS19, RPS3, 
RPS15, and RPS12 . DNA methy lation is a common epigenetic change 
in many malignancies, thus, DNA methylation patterns are 
determined by the enzymatic processes of methylation and 
demethylation. Upregulation of MBD2 , which inhibits 
transcription from methylated DNA, plays an important role in 
downregulation of tumor suppressor genes carrying methylated 
DNA at their promoter regions . 

Downregulation of a transcription-related gene, RB1CC1, 
was observed during the transition from L0 to LI. The RB1CC1 
protein is a major regulator of the tumor suppressor gene RBI, 
thereby decreased levels of RB1CC1 can promote oncogenesis via 
decreased activity of RBI protein. 

Thus, HCV- infected pre-cancerous liver is 
characterized by the altered expression of these genes, which 
suggests that initiation of hepatocarcinogenesis occurs during 
HCV infection. Among genes whose expression changes during the 
transition from L0 to LI, those involved in proteolysis and 
oncogenesis may serve as molecular targets for chemoprevention 
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of HCV-associated HCC. 

Example 9 . Genes whose expression changed during -the transition 
from pre-cancerous liver (LI) to well differentiated HCC (Gl) 

5 Genes whose expression was altered during the 

transition from LI to Gl include most oncogenesis-related genes f 
signal transduction-related genes, transcription-related 
genes, transport-related genes , detoxification-related genes , 
and immune response-related genes (Table 4) . 

* 

10 Oncogenesis-related genes such as BNIP3L, FOS, MAF, and 

IGFBP3 that can induce apoptosis of some cancer cells and IGFBP4 
that acts as an inhibitor of IGF- induced cell proliferation 
were downregulated during the transition, indicating 
downregulation of these genes is also important for the 

15 promotion of hepatocarcinogenesis . Previous report also 

showed the decreased expression of IGFBP3 and IGFBP4 in HCC 
compared with non-tumorous liver (Okabe, H. , Satoh, S. , Kato, 
T., Kitahara, O. , Yanagawa , R. , Yamaoka, Y. , Tsunoda, T., 
Furukawa, Y., and Nakamura, Y. Genome-wide analysis of gene 

20 expression in human hepatocellular carcinomas using cDNA 
microarray: identification of genes involved in viral 
carcinogenesis and tumor progression , Cancer Res. 61 , 
2129-2137 (2001) and Delpuech, 0., Trabut, J.B. , Carnot f F., 
Feuillard, J. , Brechot, C. , and Kremsdorf , D. Identification, 

25 using cDNA macroarray analysis, of distinct gene expression 
profiles associated with pathological and virological features 
of hepatocellular carcinoma, Oncogene 21, 2926-2937 (2002)) . 
The data of the present invention provide additional insights 
that downregulation of these two genes has already occurred in 

30 well differentiated HCC. MAF functions as a regulator for cell 
differentiation. BNIP3L induces cell apoptosis via inhibiting 
activity of BCL2 . In some cases, expression of FOS seems to 
be associated with apoptotic cell death. Thus, downregulation 
of these five genes is likely to trigger the transformation of 

35 hepatocyte after chronic HCV infection. 

Signal transdu>ction-related genes such as CAMKK2 , GMFB, 
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RALBPl, CDIPT, ZNF259, and RACl, and transcription-related 
genes such as DRAP1, ILF2 , BMI1, and PMFl were upregulated 
during the transition from LI to Gl . Other signal 
transduction-related genes such as CALM1, RAB14, TYROBP , and 
MAP2K1 were downregulated during this transition. 
Downregulation of TYROBP in Gl HCC may reflect decreased immune 
response. Alteration of the expression of genes involved in 
various signal transduction pathways may reflect a true 
portrait in well differentiated HCC arising from HCV-inf ected 
pre-cancerous liver. 

Transport-related genes such as TBCE , ATP6V1E, ATOX1, 
and SEC61G were upregulated, and those such as SLC31A1 and DDX19 
were downregulated during the transition from Ll to Gl . ATOX1 
that is an intracellular copper transporter was upregulated 
during the transition from L0 to Ll , and it was further 
upregulated during the transition from Ll to Gl . Since an 
excessive copper is toxic or even lethal to the hepatocytes, 
distinct expression of ATOX1 genes alters intracellular copper 
ion concentrations, thereby promotes DNA damage and cell injury. 
In fact, a recent study showed the preventive effect of 
copper-chelating agents on tumor development in the murine HCC 
xenograft model (Yoshii, J. , Yoshiji, H., Kuriyama, S., Ikenaka, 
Y., Noguchi, R. , Okuda, H., Tsujinoue, H., Nakatani, T . , 
Kishida, H. , Nakae, D., Gomez, D.E., De Lorenzo, M.S., Tejera, 
A.M., and Fukui, H. The copper-chelating agent, trientine, 
suppresses tumor development and angiogenesis in the murine 
hepatocellular carcinoma cells, Jnt. <J. Cancer. 94, 7 68-773 
(2001) ) . 

DNA damage and cell injury can be augmented by the 
downregulation of an antioxidant gene CAT and 
detoxification-related genes such as -MT1H , MT1E, MT1F, MT1B, 
MT3, and UGT2B7 , promoting the dedif f erentiation of HCC. 

Using anti-hyaluronan receptor-1 antibody, Carreira et 
al. showed that the number of lymphatic vessels was smaller in 
HCC than in non-tumorous liver tissues such as liver cirrhosis 
(Mouta Carreira, C. , Nasser, S.M. , di Tomaso, E. , Padera, T.P. , 
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Boucher, Y. , Tomarev , S.I., and Jain, R.K. LYVE-1 is not 
restricted to the lymph vessels: expression in normal liver 
blood sinusoids and down-regulation in human liver cancer and 
cirrhosis, Cancer Res. 61, 8079-8084 (2001)) . In the present 

5 invention, expression of immune response-related genes such as 
ORM1, C1R, C6, IL4R, C8B, and CIS was decreased during the 
transition from LI to Gl, indicating that changes in 
microenvironment in HCC occur during the transition from LI to 
Gl . As reported previously, many genes encoding complement 

10 component were downregulated during this transition (Okabe, H. , 
Satoh, S., Kato, T . , Kitahara, O., Yanagawa, R. , Yamaoka, Y. , 
Tsunoda, T., Furukawa, Y. , and Nakamura, Y. Genome-wide 
analysis of gene expression in human hepatocellular carcinomas 
using cDNA microarray: identification of genes involved in 

15 viral carcinogenesis and tumor progression, Cancer Res. 61, 
2129-2137 (2001) and Iizuka, N. , Oka, M. , Yamada-Okabe , H. , 
Mori, N. , Tamesa, T. , Okada, T w Takemoto, T. , Tangoku, A., 
Hamada, K. , Nakayama, H. , Miyamoto, T., Uchimura, S-, and 
Hamamoto, Y. Comparison of gene expression profiles between 

20 hepatitis B virus- and hepatitis C virus-infected 

hepatocellular carcinoma by oligonucleotide microarray data 
based on a supervised learning method, Cancer Res. 62, 
3939-3944 (2002) ) . 

25 Example 10. Genes whose expression changed during the 

transition from well differentiated HCC (Gl) to moderately 
differentiated HCC (G2) 

Genes whose expression was altered during the 
transition from Gl to G2 include IFN-related genes, cell 
30 structure and motility-related genes, transcription-related 
genes, and tumor suppressor genes (Table 5) . 

During transition from Gl to G2 , the most prominent 
genetic changes appeared to be downregulation of IFN-related 
genes such as OAS2 , STAT1, PSME1, ISGF3G, and PSMB9 . Similar 
35 genetic changes were also observed in prostate cancer cells 
(Shou, J., Soriano, R. , Hayward, S.W., Cunha, G.R. , Williams, 
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P.M. , and Gao, W.Q. Expression profiling of a human cell line 
model of prostatic cancer reveals a direct involvement of 
interferon signaling in prostate tumor progression, Proc. Natl. 
Acad. Scl. U.S.A. 99, 2830-2835 (2002)) . I FN acts not only as 

5 an antiviral agent but also as an anticancer agent; however, 
certain types of HCC do not respond to I FN . Downregulation of 
the IFN-related genes can attenuate response of tumor cells to 
IFN, suggesting that resistance of HCC to I FN is exploited 
during the transition from Gl to G2 . Among the IFN-related 

10 genes, STAT1 appeared four times in our list of discriminators 
of Gl from G2 (Table 5) . Unlike other genes of the same family, 
STAT1 functions as a tumor suppressor (Bromberg, J.F. 
Activation of STAT proteins and growth control, Bloessays 23, 
161-169 (2001)). Interestingly, IFN treatment increases STAT1 

15 expression in hepatocyte as well as many IFN-related genes 
(Radaeva, S., Jaruga, B. , Hong, F., Kim, W.H. , Fan, S., Cai, 
H. , Strom, S., Liu, Y. , El-Assal, O. , and Gao, B. 
Interf eron-alpha activates multiple STAT signals and 
down-regulates c-Met in primary human hepatocytes , 

20 Gastroenterology 122, 1020-1034 (2002)). Opregulation of 

* 

STATl in HCC cell lines was observed during differentiation 
induced by sodium butyrate (Hung, W.C. and Chuang, L.Y. Sodium 
butyrate enhances STAT 1 expression in PLC/PRF/5 hepatoma cells 
and augments their responsiveness to interf eron-alpha , Br. J. 

25 Cancer 80, 705-710 (1999)). The facts that STATl is a 

transcriptional target of the IGF- independent apoptotic effect 
of IGFBP3 (Spagnoli, A., Torello, M. , Nagalla, S.R., Horton, 
W.A. , Pattee, P., Hwa, V., Chiarelli, F., Roberts, C.T. Jr., 
and Rosenfeld, R.G. Identification of STAT-1 as a molecular 

30 target of IGFBP-3 in the process of chondrogenesis , J. Biol. 
Chem. 271, 18860-18867 (2002)) and that IGFBP3 is downregulated 
during the transition from LI to Gl strongly suggest that 
decreased expression of STATl during the transition from Gl to 
G2 HCC facilitate the further dedif f erentiation of HCC. 

35 Transcription-related gene TRIM16 that is involved in 

a variety of biological processes including cell growth, 
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differentiation, and pathogenesis, and tumor suppressor gene 
TPD52L2 that promotes cell proliferation were also upregulated 
during the transition from Gl to G2 . Upregulation of these 
genes in G2 HCC may promote growth and invasion of tumor cells. 

5 

Example 11. Genes whose expression changed during the 
transition from moderately differentiated HCC (62) to poorly 
differentiated HCC (G3) 

Genes whose expression was altered during the 

10 transition from G2 to G3 include proteolysis-related genes, 
BCL2-related gene, and metabolism- and energy generation- 
related genes (Table 6) . 

SPINT1 and LGALS9 turned out to be upregulated during 
the transition from G2 to G3 . SPINT1 is involved in regulation 

15 of proteolytic activation of hepatocyte growth factor (HGF) in 
injured tissues. Previously, Nagata et al . showed that 
transduction of antisense SPINT1 (HAI-1) inhibited the growth 
of human hepatoma cells, suggesting that SPINT1 plays an 
important role in the progression of HCC (Nagata, K. , Hirono, 

20 S., Ido, A., Kataoka, H., Moriuchi, A., Shimomura, T. , Hori, 
T., Hayashi, K. , Koono, M. , Kitamura, N . , and Tsubouchi, H. 
Expression of hepatocyte growth factor activator and 
hepatocyte growth factor activator inhibitor type 1 in human 
hepatocellular carcinoma, Blochem. Biophys. Res. Commun. 289, 

25 205-211 (2001) ) . LGALS9 belongs to a lectin family that is 
involved in cell adhesion, cell growth regulation, 
inflammation, immunomodulation, apoptosis, and metastasis. 
Several galectins are thought to be related to cancer cell 
adhesion (Ohannesian, D.W. , Lotan, D. , Thomas, P. , Jessup, J.M. , 

30 Fukuda, M. , Gabius, H.J., and Lotan, R. Carcinoembryonic 

antigen and other glycoconjugates act as ligands for galectin-3 
in human colon carcinoma cells, Cancer Res. 55, 2191-2199 
(1995) ) . 

BNIP3, a BCL2-related gene, was downregulated during 
35 the transition from G2 to G3 . BNIP3 shares 56% amino acid 
sequence identity with BNIP3L. As mentioned above, expression 
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of BNIP3L was decreased during the transition from LI to .Gl . 
Because BCL2 functions as an anti-apoptotic factor, 
downregulation of BNIP3L and BNIP3 promotes oncogenesis, 
facilitating the dedif f erentiation of tumor cells. 

5 Many metabolism- and energy generation- related genes 

were also downregulated during this transition. In addition, 
expression of PGRMC1 encoding a liver-rich protein that binds 
to progesterone and RARRES2 was also decreased during the 
transition from G2 to G3 . Decreased expression of RARRES2 may 

10 be the causative of poor response of G3 HCC to retinoic acids . 

Example 12. Color display of the expression of the selected 
genes In each transition stage 

Expression of 152 genes whose expression was 
significantly altered during the transition from L0 to LI (Fig. 
la) , 191 genes whose expression was significantly altered 
during the transition from LI to Gl (Fig. lb) , 54 genes whose 
expression was significantly altered during the transition 
from Gl to G2 (Fig. lc) , and 40 genes whose expression was 
significantly altered during the transition from G2 to G3 (Fig. 
Id) was shown by color display. These genes clearly 
distinguished the samples in the .two consecutive 
differentiation grades. Fig. le-h indicate the expression of 
the selected 40 genes in each transition stage in all the 
samples. Expression of the selected 40 genes whose expression 
was significantly altered during the transition from L0 to LI 
(Fig. le) , from Ll to Gl (Fig. If) , from Gl to G2 (Fig. Ig) , 
and from G2 to G3 (Fig. lh) was also shown by color display. 
The selected 40 genes in each transition stage discriminated 
samples before and after the transition. 

Example 13* Validation of the selected 40 genes in each 
transition stage to distinguish the differentiation grade of 
HCC 

35 To validate discriminative performance of the selected 

40 genes in each transition stage, the minimum distance 
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classifier with the selected 40 genes in each transition stage 
was created. In each transition , the minimum distance 
classifier was constructed with the samples in consecutive two 
differentiation grades as indicated by the red bar (training 
5 samples) , and was applied to the samples in the remaining 
differentiation grades as indicated by the black bar (test 
samples) (Fig. 2) . The resulting classifier classified the 
test samples with the accuracy of 92% (Fig. 2a) , 98% (Fig. 2b) , 
84% (Fig. 2c) , and 100% (Fig. 2d) . 

10 

Example 14. Analysis by the self-organizing map (SOM) algorithm 
of the genes whose expression changed during the transition 
from non-cancerous liver (1*0) to pre-cancerous liver (Ll) , from 
pre-cancerous liver (Ll) to well differentiated HCC (Gl) , from 

15 well differentiated HCC (Gl) to moderately differentiated HCC 
(G2) r and from moderately differentiated HCC (G2) to poorly 
differentiated HCC (G3) 

Expression of the genes whose expression was 
statistically significantly different between non-cancerous 

20 liver (L0) and pre-cancerous liver (Ll) , pre-cancerous liver 
(Ll) and well differentiated HCC (Gl) , well differentiated HCC 
(Gl) and moderately differentiated HCC (G2) , moderately 
differentiated HCC (G2) and poorly differentiated HCC (G3) was 
analysed according to the method of MAT LAB R13 with the SOM 

25 toolbox available in the web site, 

http://www.cis.hut.fi/projects/somtoolbox/ (Kohonen, 2001) . 
40 genes in each comparison between non-cancerous liver (LO) 
and pre-cancerous liver (Ll) , pre-cancerous liver (Ll) and well 
differentiated HCC (Gl) , well differentiated HCC (Gl) and 

30 moderately differentiated HCC (G2) , moderately differentiated 
HCC (G2) and poorly differentiated HCC (G3) were used. The 
vectors of neighboring cells were located close to each other 
in the 155-dimentional gene space (Fig. 3a) , where (m, n) 
indicated the cell located at m-th row and n-th column, NL-XX 

35 indicated samples from non-cancerous liver without HCV 

infection (LO) , IL-XX indicated samples from HCV-infected 
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pre-cancerous liver (LI) , Gl-XXT indicated samples from well 
differentiated HCC (Gl) , G2-XXT indicated samples from 
moderately differentiated HCC (G2) , G3-XXT indicated samples 
from moderately differentiated HCC (G3) . The map showed that 

5 the samples clearly formed a sigmoid curve in the order of L0, 
Ll 7 Gl , G2, and G3 . G2 samples without vessel involvement (blue 
letters) were located close to Gl samples and G2 samples with 
vessel involvement (red letters) were located close to G3 
samples (Fig. 3a) . G2 samples without venous invasion were 

10 located close to Gl samples and G2 samples with venous invasion 
were located close to G3 samples. Thus, the SOM classified G2 
samples into two subtypes, i.e. , tumor with venous invasion and 
that without venous invasion, in the stream of 
dedif f erentiation grade. When the distance between the 

15 neighboring clusters was shown by colors where red indicated 
long distance, the red cells in the upper area clearly 
demonstrated that the non-tumorous (non-cancerous and 
pre-cancerous) liver and HCC samples were relatively far apart 
in the 155-dimentional genes space (Fig. 3b) . 

20 

Industrial Applicability 

Hepatocellular carcinoma (HCC) is one of the most common 
cancers worldwide. However, there is no therapy that can cure 
the disease. This is presumably due to sequential changes in 

25 characteristics of cancer cells during the development and 
progression of the disease. Particularly, progression of 
cancer is often associated with the changes of differentiation 
grade of tumor cells. Diagnosis and management of such changes 
of cancer cells will make cancer therapy more effective. In 

30 the present invention, genes whose expression correlates with 
oncogenesis and development of HCC are identified. A 
supervised learning method followed by a random permutation 
test is used to select genes whose expression significantly 
changes during the transition from non-cancerous liver without 

35 HCV infection (L0) to pre-cancerous liver with HCV infection 
(LI) , from LI to well differentiated HCC (Gl) , from Gl to 
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moderately differentiated HCC (G2) , and from G2 to poorly 
differentiated HCC (G3) . The minimum distance classifier and 
the self-organizing map (SOM) with the selected 40 genes whose 
expression is significantly altered in each transition stage 
5 can correctly predict the differentiation grade of tumor 
tissues. Thus, these genes can be used for diagnosing the 
differentiation grade of HCC and for screening anti-cancer 
agents for the treatment of HCCs in each differentiation grade. 
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Table 1. Clusters of samples profiled to L0, LI, Gl, G2 , and 

G3. 



cell 


sample 


(1,1) 


IL-49, IL-58, IL-59, IL-60, IL-62 


(1,2) 




(1,3) 


NL-64, NL-65, NL-68, NL-69 


(1,4) 


NL-66, NL-67 


(1,5) 




(2,1) 




(2,2) 


G2-34T 


(2,3) 




(2,4) 




(2,5) 


G2-16T, G2-29T, G2-45T 


G2-2T 


(3,1) 


G1-85T, G1-87T 


(3,2) 




(3,3) 


G1-42T 


G2-22T 


(3,4) 




(3,5) 




(4,1) 


G1-86T 


G2-105T 


(4,2) 


G1-26T 


(4,3) 




(4,4) 


G2-8T, G2-27T 


(4,5) 


G2-151T 


(5,1) 


G1-147T, G1-165T 


(5,2) 




(5,3) 


G2-60T 


(5,4) 


G2-18T 


(5,5) 


G2-31T 


G2-20T, G2-59T 


(6,1) 


G3-21T 


(6,2) 


G3-80T 


(6,3) 


G2-1T, G2-163T 


G2-161T 


(6,4) 


G2-28T, G2-155T 


(6,5) 


G2-90T 


(7,1) 


G3-107T 


(7,2) 


G3-25T 


(7,3) 


G2-46T, G2-62T, G2-171T 


G2-162T 


(7,4) 




(7,5) 


G2-37T 


G2-6T, G2-58T 
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(8,1) 


G3-35T, G3-81T, G3-174T 


(8,2) 


G2-49T 


G2-23T 




G2-12T 


(8,3) 


G2-10T 




G3-19T 


(8,4) 


G2-89T 


(8,5) 


G2-43T, G2-182T 
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Table 2. Clinicopathologic characteristics per study group. 



Factors 


VvC J — L 

(Gl) 


(G2) 


POO T* 1 V 

.t \j w J- ju y 

(G3) 


P value 


Sex 








N.S. 




4 


24 


6 




Female 


3 


11 


2 




Age (year) 


65 . 3±2 . 6 


65.4±1.2 


67 .2±3.3 


N.S. 


Primary lesion 








N.S. 


Single tumor 


6 


15 


2 




Multiple tumors 


1 


20 


6 












p=0.0007 


Tumor size (cm) 


2.0±0.3 


5.0±0.5 


6 . 0±2 . 5 


V vj± VS o<£ ; 

^ n nop 

(Gl vs G3) 


Stage* 








p=0 . 066 


I 


6 


10 


2 




II 


1 


17 


3 




T T T 7\ / TXT 
J_-L±A/ J. V 


n 
u 


Q 
O 


o 




venous invasion^ 










<-) 


7 


21 


3 




( + ) 


0 


14 


5 




Non-tumorous liver 








N.S . 


Normal or chronic 


2 


15 


2 




hepatitis 








Liver cirrhosis 


5 


20 


6 





*, Tumor differentiation, stage, and venous invasion were 
5 determined on the basis of TNM classification of UICC. 

Fisher's exact test, Student's t test, and Mann-Whitney's U 
test were used to elucidate the differences in backgrounds 
between each differentiation grade. 
N.S., not significant. 
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